70 research outputs found

    Functional protein divergence in the evolution of Homo sapiens.

    Get PDF
    BACKGROUND: Protein-coding regions in a genome evolve by sequence divergence and gene gain and loss, altering the gene content of the organism. However, it is not well understood how this has given rise to the enormous diversity of metazoa present today. RESULTS: To obtain a global view of human genomic evolution, we quantify the divergence of proteins by functional category at different evolutionary distances from human. CONCLUSION: This analysis highlights some general systems-level characteristics of human evolution: regulatory processes, such as signal transducers, transcription factors and receptors, have a high degree of plasticity, while core processes, such as metabolism, transport and protein synthesis, are largely conserved. Additionally, this study reveals a dynamic picture of selective forces at short, medium and long evolutionary timescales. Certain functional categories, such as 'development' and 'organogenesis', exhibit temporal patterns of sequence divergence in eukaryotes relative to human. This framework for a grammar of human evolution supports previously postulated theories of robustness and evolvability.RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are

    Patterns of evolutionary constraints on genes in humans

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Different regions in a genome evolve at different rates depending on structural and functional constraints. Some genomic regions are highly conserved during metazoan evolution, while other regions may evolve rapidly, either in all species or in a lineage-specific manner. A strong or even moderate change in constraints in functional regions, for example in coding regions, can have significant evolutionary consequences.</p> <p>Results</p> <p>Here we discuss a novel framework, 'BaseDiver', to classify groups of genes in humans based on the patterns of evolutionary constraints on polymorphic positions in their coding regions. Comparing the nucleotide-level divergence among mammals with the extent of deviation from the ancestral base in the human lineage, we identify patterns of evolutionary pressure on nonsynonymous base-positions in groups of genes belonging to the same functional category. Focussing on groups of genes in functional categories, we find that transcription factors contain a significant excess of nonsynonymous base-positions that are conserved in other mammals but changed in human, while immunity related genes harbour mutations at base-positions that evolve rapidly in all mammals including humans due to strong preference for advantageous alleles. Genes involved in olfaction also evolve rapidly in all mammals, and in humans this appears to be due to weak negative selection.</p> <p>Conclusion</p> <p>While recent studies have identified genes under positive selection in humans, our approach identifies evolutionary constraints on Gene Ontology groups identifying changes in humans relative to some of the other mammals.</p

    Prioritization of candidate cancer genes—an aid to oncogenomic studies

    Get PDF
    The development of techniques for oncogenomic analyses such as array comparative genomic hybridization, messenger RNA expression arrays and mutational screens have come to the fore in modern cancer research. Studies utilizing these techniques are able to highlight panels of genes that are altered in cancer. However, these candidate cancer genes must then be scrutinized to reveal whether they contribute to oncogenesis or are coincidental and non-causative. We present a computational method for the prioritization of candidate (i) proto-oncogenes and (ii) tumour suppressor genes from oncogenomic experiments. We constructed computational classifiers using different combinations of sequence and functional data including sequence conservation, protein domains and interactions, and regulatory data. We found that these classifiers are able to distinguish between known cancer genes and other human genes. Furthermore, the classifiers also discriminate candidate cancer genes from a recent mutational screen from other human genes. We provide a web-based facility through which cancer biologists may access our results and we propose computational cancer gene classification as a useful method of prioritizing candidate cancer genes identified in oncogenomic studies

    Co-Regulation of Histone-Modifying Enzymes in Cancer

    Get PDF
    Cancer is characterized by aberrant patterns of expression of multiple genes. These major shifts in gene expression are believed to be due to not only genetic but also epigenetic changes. The epigenetic changes are communicated through chemical modifications, including histone modifications. However, it is unclear whether the binding of histone-modifying proteins to genomic regions and the placing of histone modifications efficiently discriminates corresponding genes from the rest of the genes in the human genome. We performed gene expression analysis of histone demethylases (HDMs) and histone methyltransferases (HMTs), their target genes and genes with relevant histone modifications in normal and tumor tissues. Surprisingly, this analysis revealed the existence of correlations in the expression levels of different HDMs and HMTs. The observed HDM/HMT gene expression signature was specific to particular normal and cancer cell types and highly correlated with target gene expression and the expression of genes with histone modifications. Notably, we observed that trimethylation at lysine 4 and lysine 27 separated preferentially expressed and underexpressed genes, which was strikingly different in cancer cells compared to normal cells. We conclude that changes in coordinated regulation of enzymes executing histone modifications may underlie global epigenetic changes occurring in cancer

    Genome-wide Analysis using ChIP to Identify Isoform-specific Gene Targets

    Get PDF
    Recruitment of transcriptional and epigenetic factors to their targets is a key step in their regulation. Prominently featured in recruitment are the protein domains that bind to specific histone modifications. One such domain is the plant homeodomain (PHD), found in several chromatin-binding proteins. The epigenetic factor RBP2 has multiple PHD domains, however, they have different functions (Figure 4). In particular, the C-terminal PHD domain, found in a RBP2 oncogenic fusion in human leukemia, binds to trimethylated lysine 4 in histone H3 (H3K4me3)1. The transcript corresponding to the RBP2 isoform containing the C-terminal PHD accumulates during differentiation of promonocytic, lymphoma-derived, U937 cells into monocytes2. Consistent with both sets of data, genome-wide analysis showed that in differentiated U937 cells, the RBP2 protein gets localized to genomic regions highly enriched for H3K4me33. Localization of RBP2 to its targets correlates with a decrease in H3K4me3 due to RBP2 histone demethylase activity and a decrease in transcriptional activity. In contrast, two other PHDs of RBP2 are unable to bind H3K4me3. Notably, the C-terminal domain PHD of RBP2 is absent in the smaller RBP2 isoform4. It is conceivable that the small isoform of RBP2, which lacks interaction with H3K4me3, differs from the larger isoform in genomic location. The difference in genomic location of RBP2 isoforms may account for the observed diversity in RBP2 function. Specifically, RBP2 is a critical player in cellular differentiation mediated by the retinoblastoma protein (pRB). Consistent with these data, previous genome-wide analysis, without distinction between isoforms, identified two distinct groups of RBP2 target genes: 1) genes bound by RBP2 in a manner that is independent of differentiation; 2) genes bound by RBP2 in a differentiation-dependent manner

    Whole genome analysis of p38 SAPK-mediated gene expression upon stress

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Cells have the ability to respond and adapt to environmental changes through activation of stress-activated protein kinases (SAPKs). Although p38 SAPK signalling is known to participate in the regulation of gene expression little is known on the molecular mechanisms used by this SAPK to regulate stress-responsive genes and the overall set of genes regulated by p38 in response to different stimuli.</p> <p>Results</p> <p>Here, we report a whole genome expression analyses on mouse embryonic fibroblasts (MEFs) treated with three different p38 SAPK activating-stimuli, namely osmostress, the cytokine TNFα and the protein synthesis inhibitor anisomycin. We have found that the activation kinetics of p38α SAPK in response to these insults is different and also leads to a complex gene pattern response specific for a given stress with a restricted set of overlapping genes. In addition, we have analysed the contribution of p38α the major p38 family member present in MEFs, to the overall stress-induced transcriptional response by using both a chemical inhibitor (SB203580) and p38α deficient (p38α<sup>-/-</sup>) MEFs. We show here that p38 SAPK dependency ranged between 60% and 88% depending on the treatments and that there is a very good overlap between the inhibitor treatment and the ko cells. Furthermore, we have found that the dependency of SAPK varies depending on the time the cells are subjected to osmostress.</p> <p>Conclusions</p> <p>Our genome-wide transcriptional analyses shows a selective response to specific stimuli and a restricted common response of up to 20% of the stress up-regulated early genes that involves an important set of transcription factors, which might be critical for either cell adaptation or preparation for continuous extra-cellular changes. Interestingly, up to 85% of the up-regulated genes are under the transcriptional control of p38 SAPK. Thus, activation of p38 SAPK is critical to elicit the early gene expression program required for cell adaptation to stress.</p

    Mutational signature dynamics shaping the evolution of oesophageal adenocarcinoma

    Get PDF
    A variety of mutational processes drive cancer development, but their dynamics across the entire disease spectrum from pre-cancerous to advanced neoplasia are poorly understood. We explore the mutagenic processes shaping oesophageal adenocarcinoma tumorigenesis in 997 instances comprising distinct stages of this malignancy, from Barrett Oesophagus to primary tumours and advanced metastatic disease. The mutational landscape is dominated by the C[T > C/G]T substitution enriched signatures SBS17a/b, which are linked with TP53 mutations, increased proliferation, genomic instability and disease progression. The APOBEC mutagenesis signature is a weak but persistent signal amplified in primary tumours. We also identify prevalent alterations in DNA damage repair pathways, with homologous recombination, base and nucleotide excision repair and translesion synthesis mutated in up to 50% of the cohort, and surprisingly uncoupled from transcriptional activity. Among these, the presence of base excision repair deficiencies show remarkably poor prognosis in the cohort. In this work, we provide insights on the mutational aetiology and changes enabling the transition from pre-neoplastic to advanced oesophageal adenocarcinoma

    Gitools: Analysis and Visualisation of Genomic Data Using Interactive Heat-Maps

    Get PDF
    Intuitive visualization of data and results is very important in genomics, especially when many conditions are to be analyzed and compared. Heat-maps have proven very useful for the representation of biological data. Here we present Gitools (http://www.gitools.org), an open-source tool to perform analyses and visualize data and results as interactive heat-maps. Gitools contains data import systems from several sources (i.e. IntOGen, Biomart, KEGG, Gene Ontology), which facilitate the integration of novel data with previous knowledge

    Computational disease gene identification: a concert of methods prioritizes type 2 diabetes and obesity candidate genes

    Get PDF
    Genome-wide experimental methods to identify disease genes, such as linkage analysis and association studies, generate increasingly large candidate gene sets for which comprehensive empirical analysis is impractical. Computational methods employ data from a variety of sources to identify the most likely candidate disease genes from these gene sets. Here, we review seven independent computational disease gene prioritization methods, and then apply them in concert to the analysis of 9556 positional candidate genes for type 2 diabetes (T2D) and the related trait obesity. We generate and analyse a list of nine primary candidate genes for T2D genes and five for obesity. Two genes, LPL and BCKDHA, are common to these two sets. We also present a set of secondary candidates for T2D (94 genes) and for obesity (116 genes) with 58 genes in common to both diseases
    corecore